Pesquisa | Portal Regional da BVS

1.

Long-read sequencing across the C9orf72 'GGGGCC' repeat expansion: implications for clinical use and genetic discovery efforts in human disease.

Ebbert, Mark T W; Farrugia, Stefan L; Sens, Jonathon P; Jansen-West, Karen; Gendron, Tania F; Prudencio, Mercedes; McLaughlin, Ian J; Bowman, Brett; Seetin, Matthew; DeJesus-Hernandez, Mariely; Jackson, Jazmyne; Brown, Patricia H; Dickson, Dennis W; van Blitterswijk, Marka; Rademakers, Rosa; Petrucelli, Leonard; Fryer, John D.

Mol Neurodegener ; 13(1): 46, 2018 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-30126445

RESUMO

BACKGROUND: Many neurodegenerative diseases are caused by nucleotide repeat expansions, but most expansions, like the C9orf72 'GGGGCC' (G4C2) repeat that causes approximately 5-7% of all amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) cases, are too long to sequence using short-read sequencing technologies. It is unclear whether long-read sequencing technologies can traverse these long, challenging repeat expansions. Here, we demonstrate that two long-read sequencing technologies, Pacific Biosciences' (PacBio) and Oxford Nanopore Technologies' (ONT), can sequence through disease-causing repeats cloned into plasmids, including the FTD/ALS-causing G4C2 repeat expansion. We also report the first long-read sequencing data characterizing the C9orf72 G4C2 repeat expansion at the nucleotide level in two symptomatic expansion carriers using PacBio whole-genome sequencing and a no-amplification (No-Amp) targeted approach based on CRISPR/Cas9. RESULTS: Both the PacBio and ONT platforms successfully sequenced through the repeat expansions in plasmids. Throughput on the MinION was a challenge for whole-genome sequencing; we were unable to attain reads covering the human C9orf72 repeat expansion using 15 flow cells. We obtained 8× coverage across the C9orf72 locus using the PacBio Sequel, accurately reporting the unexpanded allele at eight repeats, and reading through the entire expansion with 1324 repeats (7941 nucleotides). Using the No-Amp targeted approach, we attained > 800× coverage and were able to identify the unexpanded allele, closely estimate expansion size, and assess nucleotide content in a single experiment. We estimate the individual's repeat region was > 99% G4C2 content, though we cannot rule out small interruptions. CONCLUSIONS: Our findings indicate that long-read sequencing is well suited to characterizing known repeat expansions, and for discovering new disease-causing, disease-modifying, or risk-modifying repeat expansions that have gone undetected with conventional short-read sequencing. The PacBio No-Amp targeted approach may have future potential in clinical and genetic counseling environments. Larger and deeper long-read sequencing studies in C9orf72 expansion carriers will be important to determine heterogeneity and whether the repeats are interrupted by non-G4C2 content, potentially mitigating or modifying disease course or age of onset, as interruptions are known to do in other repeat-expansion disorders. These results have broad implications across all diseases where the genetic etiology remains unclear.

Assuntos

Proteína C9orf72/genética , Expansão das Repetições de DNA/genética , Demência Frontotemporal/genética , Análise de Sequência de DNA/métodos , Adulto , Idoso , Feminino , Humanos , Masculino , Técnicas de Amplificação de Ácido Nucleico/métodos

2.

Improving RNA nearest neighbor parameters for helices by going beyond the two-state model.

Spasic, Aleksandar; Berger, Kyle D; Chen, Jonathan L; Seetin, Matthew G; Turner, Douglas H; Mathews, David H.

Nucleic Acids Res ; 46(10): 4883-4892, 2018 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-29718397

RESUMO

RNA folding free energy change nearest neighbor parameters are widely used to predict folding stabilities of secondary structures. They were determined by linear regression to datasets of optical melting experiments on small model systems. Traditionally, the optical melting experiments are analyzed assuming a two-state model, i.e. a structure is either complete or denatured. Experimental evidence, however, shows that structures exist in an ensemble of conformations. Partition functions calculated with existing nearest neighbor parameters predict that secondary structures can be partially denatured, which also directly conflicts with the two-state model. Here, a new approach for determining RNA nearest neighbor parameters is presented. Available optical melting data for 34 Watson-Crick helices were fit directly to a partition function model that allows an ensemble of conformations. Fitting parameters were the enthalpy and entropy changes for helix initiation, terminal AU pairs, stacks of Watson-Crick pairs and disordered internal loops. The resulting set of nearest neighbor parameters shows a 38.5% improvement in the sum of residuals in fitting the experimental melting curves compared to the current literature set.

Assuntos

Biologia Computacional/métodos , Modelos Químicos , RNA/química , Algoritmos , Entropia , Conformação de Ácido Nucleico , Termodinâmica , Temperatura de Transição

3.

Adeno-associated Virus Genome Population Sequencing Achieves Full Vector Genome Resolution and Reveals Human-Vector Chimeras.

Tai, Phillip W L; Xie, Jun; Fong, Kaiyuen; Seetin, Matthew; Heiner, Cheryl; Su, Qin; Weiand, Michael; Wilmot, Daniella; Zapp, Maria L; Gao, Guangping.

Mol Ther Methods Clin Dev ; 9: 130-141, 2018 Jun 15.

Artigo em Inglês | MEDLINE | ID: mdl-29766023

RESUMO

Recombinant adeno-associated virus (rAAV)-based gene therapy has entered a phase of clinical translation and commercialization. Despite this progress, vector integrity following production is often overlooked. Compromised vectors may negatively impact therapeutic efficacy and safety. Using single molecule, real-time (SMRT) sequencing, we can comprehensively profile packaged genomes as a single intact molecule and directly assess vector integrity without extensive preparation. We have exploited this methodology to profile all heterogeneic populations of self-complementary AAV genomes via bioinformatics pipelines and have coined this approach AAV-genome population sequencing (AAV-GPseq). The approach can reveal the relative distribution of truncated genomes versus full-length genomes in vector preparations. Preparations that seemingly show high genome homogeneity by gel electrophoresis are revealed to consist of less than 50% full-length species. With AAV-GPseq, we can also detect many reverse-packaged genomes that encompass sequences originating from plasmid backbone, as well as sequences from packaging and helper plasmids. Finally, we detect host-cell genomic sequences that are chimeric with inverted terminal repeat (ITR)-containing vector sequences. We show that vector populations can contain between 1.3% and 2.3% of this type of undesirable genome. These discoveries redefine quality control standards for viral vector preparations and highlight the degree of foreign products in rAAV-based therapeutic vectors.

4.

The Diversity, Structure, and Function of Heritable Adaptive Immunity Sequences in the Aedes aegypti Genome.

Whitfield, Zachary J; Dolan, Patrick T; Kunitomi, Mark; Tassetto, Michel; Seetin, Matthew G; Oh, Steve; Heiner, Cheryl; Paxinos, Ellen; Andino, Raul.

Curr Biol ; 27(22): 3511-3519.e7, 2017 Nov 20.

Artigo em Inglês | MEDLINE | ID: mdl-29129531

RESUMO

The Aedes aegypti mosquito transmits arboviruses, including dengue, chikungunya, and Zika virus. Understanding the mechanisms underlying mosquito immunity could provide new tools to control arbovirus spread. Insects exploit two different RNAi pathways to combat viral and transposon infection: short interfering RNAs (siRNAs) and PIWI-interacting RNAs (piRNAs) [1, 2]. Endogenous viral elements (EVEs) are sequences from non-retroviral viruses that are inserted into the mosquito genome and can act as templates for the production of piRNAs [3, 4]. EVEs therefore represent a record of past infections and a reservoir of potential immune memory [5]. The large-scale organization of EVEs has been difficult to resolve with short-read sequencing because they tend to integrate into repetitive regions of the genome. To define the diversity, organization, and function of EVEs, we took advantage of the contiguity associated with long-read sequencing to generate a high-quality assembly of the Ae. aegypti-derived Aag2 cell line genome, an important and widely used model system. We show EVEs are acquired through recombination with specific classes of long terminal repeat (LTR) retrotransposons and organize into large loci (>50 kbp) characterized by high LTR density. These EVE-containing loci have increased density of piRNAs compared to similar regions without EVEs. Furthermore, we detected EVE-derived piRNAs consistent with a targeted processing of persistently infecting virus genomes. We propose that comparisons of EVEs across mosquito populations may explain differences in vector competence, and further study of the structure and function of these elements in the genome of mosquitoes may lead to epidemiological interventions.

Assuntos

Imunidade Adaptativa/genética , Aedes/genética , Aedes/imunologia , Animais , Elementos de DNA Transponíveis/genética , Genoma , Mosquitos Vetores/genética , Mosquitos Vetores/imunologia , Interferência de RNA/imunologia , RNA Interferente Pequeno/genética

5.

Complete telomere-to-telomere de novo assembly of the Plasmodium falciparum genome through long-read (>11 kb), single molecule, real-time sequencing.

Vembar, Shruthi Sridhar; Seetin, Matthew; Lambert, Christine; Nattestad, Maria; Schatz, Michael C; Baybayan, Primo; Scherf, Artur; Smith, Melissa Laird.

DNA Res ; 23(4): 339-51, 2016 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-27345719

RESUMO

The application of next-generation sequencing to estimate genetic diversity of Plasmodium falciparum, the most lethal malaria parasite, has proved challenging due to the skewed AT-richness [â¼80.6% (A + T)] of its genome and the lack of technology to assemble highly polymorphic subtelomeric regions that contain clonally variant, multigene virulence families (Ex: var and rifin). To address this, we performed amplification-free, single molecule, real-time sequencing of P. falciparum genomic DNA and generated reads of average length 12 kb, with 50% of the reads between 15.5 and 50 kb in length. Next, using the Hierarchical Genome Assembly Process, we assembled the P. falciparum genome de novo and successfully compiled all 14 nuclear chromosomes telomere-to-telomere. We also accurately resolved centromeres [â¼90-99% (A + T)] and subtelomeric regions and identified large insertions and duplications that add extra var and rifin genes to the genome, along with smaller structural variants such as homopolymer tract expansions. Overall, we show that amplification-free, long-read sequencing combined with de novo assembly overcomes major challenges inherent to studying the P. falciparum genome. Indeed, this technology may not only identify the polymorphic and repetitive subtelomeric sequences of parasite populations from endemic areas but may also evaluate structural variation linked to virulence, drug resistance and disease transmission.

Assuntos

Genoma de Protozoário , Plasmodium falciparum/genética , Telômero/genética , Mapeamento de Sequências Contíguas , Polimorfismo Genético , Análise de Sequência de DNA

6.

DNA methylation on N(6)-adenine in mammalian embryonic stem cells.

Wu, Tao P; Wang, Tao; Seetin, Matthew G; Lai, Yongquan; Zhu, Shijia; Lin, Kaixuan; Liu, Yifei; Byrum, Stephanie D; Mackintosh, Samuel G; Zhong, Mei; Tackett, Alan; Wang, Guilin; Hon, Lawrence S; Fang, Gang; Swenberg, James A; Xiao, Andrew Z.

Nature ; 532(7599): 329-33, 2016 Apr 21.

Artigo em Inglês | MEDLINE | ID: mdl-27027282

RESUMO

It has been widely accepted that 5-methylcytosine is the only form of DNA methylation in mammalian genomes. Here we identify N(6)-methyladenine as another form of DNA modification in mouse embryonic stem cells. Alkbh1 encodes a demethylase for N(6)-methyladenine. An increase of N(6)-methyladenine levels in Alkbh1-deficient cells leads to transcriptional silencing. N(6)-methyladenine deposition is inversely correlated with the evolutionary age of LINE-1 transposons; its deposition is strongly enriched at young (<1.5 million years old) but not old (>6 million years old) L1 elements. The deposition of N(6)-methyladenine correlates with epigenetic silencing of such LINE-1 transposons, together with their neighbouring enhancers and genes, thereby resisting the gene activation signals during embryonic stem cell differentiation. As young full-length LINE-1 transposons are strongly enriched on the X chromosome, genes located on the X chromosome are also silenced. Thus, N(6)-methyladenine developed a new role in epigenetic silencing in mammalian evolution distinct from its role in gene activation in other organisms. Our results demonstrate that N(6)-methyladenine constitutes a crucial component of the epigenetic regulation repertoire in mammalian genomes.

Assuntos

Adenina/análogos & derivados , Metilação de DNA , Epigênese Genética/genética , Células-Tronco Embrionárias Murinas/metabolismo , Adenina/metabolismo , Homólogo AlkB 1 da Histona H2a Dioxigenase , Animais , Diferenciação Celular/genética , Elementos de DNA Transponíveis/genética , DNA Liase (Sítios Apurínicos ou Apirimidínicos)/deficiência , DNA Liase (Sítios Apurínicos ou Apirimidínicos)/genética , DNA Liase (Sítios Apurínicos ou Apirimidínicos)/metabolismo , Elementos Facilitadores Genéticos/genética , Evolução Molecular , Inativação Gênica , Elementos Nucleotídeos Longos e Dispersos/genética , Mamíferos/genética , Camundongos , Células-Tronco Embrionárias Murinas/citologia , Regulação para Cima/genética , Cromossomo X/genética , Cromossomo X/metabolismo

7.

Modified Amber Force Field Correctly Models the Conformational Preference for Tandem GA pairs in RNA.

Aytenfisu, Asaminew H; Spasic, Aleksandar; Seetin, Matthew G; Serafini, John; Mathews, David H.

J Chem Theory Comput ; 10(3): 1292-1301, 2014 Mar 11.

Artigo em Inglês | MEDLINE | ID: mdl-24803859

RESUMO

Molecular mechanics with all-atom models was used to understand the conformational preference of tandem guanine-adenine (GA) noncanonical pairs in RNA. These tandem GA pairs play important roles in determining stability, flexibility, and structural dynamics of RNA tertiary structures. Previous solution structures showed that these tandem GA pairs adopt either imino (cis Watson-Crick/Watson-Crick A-G) or sheared (trans Hoogsteen/sugar edge A-G) conformations depending on the sequence and orientation of the adjacent closing base pairs. The solution structures (GCGGACGC)2 [Biochemistry, 1996, 35, 9677-9689] and (GCGGAUGC)2 [Biochemistry, 2007, 46, 1511-1522] demonstrate imino and sheared conformations for the two central GA pairs, respectively. These systems were studied using molecular dynamics and free energy change calculations for conformational changes, using umbrella sampling. For the structures to maintain their native conformations during molecular dynamics simulations, a modification to the standard Amber ff10 force field was required, which allowed the amino group of guanine to leave the plane of the base [J. Chem. Theory Comput., 2009, 5, 2088-2100] and form out-of-plane hydrogen bonds with a cross-strand cytosine or uracil. The requirement for this modification suggests the importance of out-of-plane hydrogen bonds in stabilizing the native structures. Free energy change calculations for each sequence demonstrated the correct conformational preference when the force field modification was used, but the extent of the preference is underestimated.

8.

Massively parallel RNA chemical mapping with a reduced bias MAP-seq protocol.

Seetin, Matthew G; Kladwang, Wipapat; Bida, John P; Das, Rhiju.

Methods Mol Biol ; 1086: 95-117, 2014.

Artigo em Inglês | MEDLINE | ID: mdl-24136600

RESUMO

Chemical mapping methods probe RNA structure by revealing and leveraging correlations of a nucleotide's structural accessibility or flexibility with its reactivity to various chemical probes. Pioneering work by Lucks and colleagues has expanded this method to probe hundreds of molecules at once on an Illumina sequencing platform, obviating the use of slab gels or capillary electrophoresis on one molecule at a time. Here, we describe optimizations to this method from our lab, resulting in the MAP-seq protocol (Multiplexed Accessibility Probing read out through sequencing), version 1.0. The protocol permits the quantitative probing of thousands of RNAs at once, by several chemical modification reagents, on the time scale of a day using a tabletop Illumina machine. This method and a software package MAPseeker ( http://simtk.org/home/map_seeker ) address several potential sources of bias, by eliminating PCR steps, improving ligation efficiencies of ssDNA adapters, and avoiding problematic heuristics in prior algorithms. We hope that the step-by-step description of MAP-seq 1.0 will help other RNA mapping laboratories to transition from electrophoretic to next-generation sequencing methods and to further reduce the turnaround time and any remaining biases of the protocol.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/química , Análise de Sequência de RNA/métodos , Sequência de Bases , Dados de Sequência Molecular , RNA/genética , Software

9.

RNAstructure: Web servers for RNA secondary structure prediction and analysis.

Bellaousov, Stanislav; Reuter, Jessica S; Seetin, Matthew G; Mathews, David H.

Nucleic Acids Res ; 41(Web Server issue): W471-4, 2013 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-23620284

RESUMO

RNAstructure is a software package for RNA secondary structure prediction and analysis. This contribution describes a new set of web servers to provide its functionality. The web server offers RNA secondary structure prediction, including free energy minimization, maximum expected accuracy structure prediction and pseudoknot prediction. Bimolecular secondary structure prediction is also provided. Additionally, the server can predict secondary structures conserved in either two homologs or more than two homologs. Folding free energy changes can be predicted for a given RNA structure using nearest neighbor rules. Secondary structures can be compared using circular plots or the scoring methods, sensitivity and positive predictive value. Additionally, structure drawings can be rendered as SVG, postscript, jpeg or pdf. The web server is freely available for public use at: http://rna.urmc.rochester.edu/RNAstructureWeb.

Assuntos

RNA/química , Software , Internet , Conformação de Ácido Nucleico

10.

RNA structure prediction: an overview of methods.

Seetin, Matthew G; Mathews, David H.

Methods Mol Biol ; 905: 99-122, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22736001

RESUMO

RNA is now appreciated to serve numerous cellular roles, and understanding RNA structure is important for understanding a mechanism of action. This contribution discusses the methods available for predicting RNA structure. Secondary structure is the set of the canonical base pairs, and secondary structure can be accurately determined by comparative sequence analysis. Secondary structure can also be predicted. The most commonly used method is free energy minimization. The accuracy of structure prediction is improved either by using experimental mapping data or by predicting a structure conserved in a set of homologous sequences. Additionally, tertiary structure, the three-dimensional arrangement of atoms, can be modeled with guidance from comparative analysis and experimental techniques. New approaches are also available for predicting tertiary structure.

Assuntos

Conformação de Ácido Nucleico , RNA/química , Análise de Sequência de RNA/métodos , Algoritmos , Sequência de Bases , Dados de Sequência Molecular , RNA/genética , Termodinâmica

11.

TurboKnot: rapid prediction of conserved RNA secondary structures including pseudoknots.

Seetin, Matthew G; Mathews, David H.

Bioinformatics ; 28(6): 792-8, 2012 Mar 15.

Artigo em Inglês | MEDLINE | ID: mdl-22285566

RESUMO

MOTIVATION: Many RNA molecules function without being translated into proteins, and function depends on structure. Pseudoknots are motifs in RNA secondary structures that are difficult to predict but are also often functionally important. RESULTS: TurboKnot is a new algorithm for predicting the secondary structure, including pseudoknotted pairs, conserved across multiple sequences. TurboKnot finds 81.6% of all known base pairs in the systems tested, and 75.6% of predicted pairs were found in the known structures. Pseudoknots are found with half or better of the false-positive rate of previous methods.

Assuntos

Algoritmos , Conformação de Ácido Nucleico , RNA não Traduzido/química , Mycobacterium tuberculosis/química , Mycobacterium tuberculosis/genética , RNA Bacteriano/química , Ribonuclease P/química , Análise de Sequência de RNA

12.

Automated RNA tertiary structure prediction from secondary structure and low-resolution restraints.

Seetin, Matthew G; Mathews, David H.

J Comput Chem ; 32(10): 2232-44, 2011 Jul 30.

Artigo em Inglês | MEDLINE | ID: mdl-21509787

RESUMO

A novel protocol for all-atom RNA tertiary structure prediction is presented that uses restrained molecular mechanics and simulated annealing. The restraints are from secondary structure, covariation analysis, coaxial stacking predictions for helices in junctions, and, when available, cross-linking data. Results are demonstrated on the Alu domain of the mammalian signal recognition particle RNA, the Saccharomyces cerevisiae phenylalanine tRNA, the hammerhead ribozyme, the hepatitis C virus internal ribosomal entry site, and the P4-P6 domain of the Tetrahymena thermophila group I intron. The predicted structure is selected from a pool of decoy structures with a score that maximizes radius of gyration and base-base contacts, which was empirically found to select higher quality decoys. This simple ab initio approach is sufficient to make good predictions of the structure of RNAs compared to current crystal structures using both root mean square deviation and the accuracy of base-base contacts.

Assuntos

Conformação de Ácido Nucleico , RNA/química , Íntrons , Simulação de Dinâmica Molecular , RNA Catalítico/química , RNA de Transferência de Fenilalanina/química , Partícula de Reconhecimento de Sinal/química

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA